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, Abstract 

^ I We present two algorithms for maintaining the topological order of a directed acyclic graph with n 

O ' vertices, under an online edge insertion sequence of m edges. Efficient algorithms for online topological 

\ ordering have many applications, including online cycle detection, which is to discover the first edge that 

introduces a cycle under an arbitrary sequence of edge insertions in a directed graph. The current fastest 
algorithms for the online topological ordering problem run in time (9(min(m^/^logn,OT^/^ + n^logn)) 
and (9(n^^^) (the latter algorithm is faster for dense graphs, i.e., when m > n 11/*'). In this paper we 
, present faster algorithms for this problem. 

' We first present a simple algorithm with running time 0{rv'l^) for the online topological ordering 

C/3 \ problem. This is the current fastest algorithm for this problem on dense graphs, i.e., when m > rr'/^. 

^ O ^ ■ We then present an algorithm with running time 0{{m + n\ogn)y/m), which is an improvement over the 

0(min(m^/^logn,m^/^ + n^log«)) algorithm - it is ^strict improvement whenmis sandwiched between 
a)(n) and (9(n^/^). Our results yield an improved upper bound of (9(min(n^/^, (m + «logn)-ym)) for the 
online topological ordering problem. 

in 

^ ! 1 Introduction 

Let G = {V,E) he a. directed acyclic graph (DAG) with |V| = « and \E\ = m. In a topological ordering, 
I each vertex v S V is associated with a value ord(v) such that for each directed edge (m,v) e £ we have 

^ ' ord(M) < ord(v). When the graph G is known in advance (i.e., in an offline setting), there exist well-known 

algorithms to compute a topological ordering of G in 0{m + n) time [4]. 
^ \ In the online topological ordering problem, the edges of the graph G are not known in advance but are 

^ I given one at a time. We are asked to maintain a topological ordering of G under these edge insertions. That 

is, each time an edge is added to G, we are required to update the function ord so that for all the edges 
(m,v) in G, it holds that ord(M) < ord(v). The naive way of maintaining an online topological order, which 
is to compute the order each time from scratch with the offline algorithm, takes 0{m^ + mn) time. However 
such an algorithm is too slow when the number of edges, m, is large. Faster algorithms are known for this 
problem (see Section 1.1). We show the following results here^. 

Theorem 1 An online topological ordering of a directed acyclic graph G on n vertices, under a sequence 
of arbitrary edge insertions, can be computed in time 0{n^/^), independent of the number of edges inserted. 

Theorem 2 An online topological ordering of a directed acyclic graph Gonn vertices, under any sequence 
of insertions ofm edges, can be computed in time 0{{m + n\ogn)^/m). 

The online topological ordering problem has several applications and efficient algorithms for this prob- 
lem are used in online cycle detection routine in pointer analysis [11] and in incremental evaluation of 
computational circuits [2]. This problem has also been studied in the context of compilation [8, 9] where 
dependencies between modules are maintained to reduce the amount of recompilation performed when an 
update occurs. 



^Here and in the rest of the paper, we make the usual assumption that m is Q.{n) and there are no parallel edges, so m is 0(n^ 
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1.1 Previous Results 



The online topological ordering problem has been well-studied. Marchetti-Spaccamela et al. [7] gave an 
algorithm that can insert m edges in 0{mn) time. Alpern et al. [2] proposed a different algorithm which runs 
in 0(11 8 II log II 8 II) time per edge insertion with || 8 || measuring the number of edges of the minimal vertex 
subgraph that needs to be updated. However, note that not all the edges of this subgraph need to be updated 
and hence even || 8 || time per insertion is not optimal. Katriel and Bodlaender [6] analyzed a variant of the 
algorithm in [2] and obtained an upper bound of 0(min{m^/^log«, m^/^ + «^log?i}) for a general DAG. 
In addition, they show that their algorithm runs in time 0{mklog^ n) for a DAG for which the underlying 
undirected graph has a treewidth of k, and they show an optimal running time of O(nlogn) for trees. Pearce 
and Kelly [10] present an algorithm that empirically performs very well on sparse random DAGs, although 
its worst case running time is inferior to [6]. 

Ajwani et al. [1] gave the first o{n^) algorithm for the online topological ordering problem. They 
propose a simple algorithm that works in time 0(n^■'^^^/logn) and 0{n^) space, thereby improving upon the 
algorithm [6] for dense DAGs. With some simple modifications in their data structure, they get 0{rp-^^) 
time and C?(?i^^^) space. They also demonstrate empirically that their algorithm outperforms the algorithms 
in [10, 7, 2] on a certain class of hard sequence of edge insertions. 

The only non-trivial lower bound for online topological ordering is due to Ramalingam and Reps [12], 
who showed that an adversary can force any algorithm to perform n(nlogn) vertex relabeling operations 
while inserting n — l edges (creating a chain). There is a large gap between the lower bound of ^{m+nlogn) 
and the upper bound of 0(min{n^-'^, m^/^logn, m^/^ +n^logn}). 

Our Results. The contributions of our paper are as follows: 

• Theorem 1 shows an upper bound of 0{rv'l^) for the online topological ordering problem. This is 
always better than the previous best upper bound of 0{rP"'^^) in [1] for dense graphs. Our 0{n^^^) 
algorithm is the current fastest algorithm for online topological ordering when m > n^l^. 

• Theorem 2 shows another improved upper bound of 0{{m + n\ogn)y/m). This improves upon the 
bounds of 0{nr'l^\ogn) and 0{nr'l^ + n^\ogn) given in [6]. Note that this is a strict improvement 
over 0(min(m^/^logn, (m^/^ -|-n^logn))) when m is sandwiched between co(n) and 0(n^l^). 

Combining our two algorithms, we have an improved upper bound of 0(min(n^/-^, {m+n\ogn)^/m)) for the 
online topological ordering problem. 

Our 0(n^/^) algorithm is very simple and basically involves traversing successive locations of an array 
and checking the entries of the adjacency matrix; it uses no special data structures and is easy to implement, 
so it would be an efficient online cycle detection subroutine in practice also. The tricky part here is showing 
the bound on its running time and we use a result from [1] in its analysis. Our 0{m + n\ogn)y/m) algorithm 
is an adaptation of the Katriel-Bodlaender algorithm in [6] (in turn based on the algorithm in [2]) and uses 
the Dietz-Sleator ordered list data structure and Fibonacci heaps. 

Organization of the paper. In Section 2 we present our 0{rr'/'^) algorithm and show its correctness. We 
analyze its running time in Section 3. We present our 0{{m + n\ogn)^/m) algorithm and its analysis in 
Section 4. The missing details are given in the Appendix. 

2 The 0(«V2) Algorithm 

We have a directed acyclic graph G on vertex set V . In this section we present an algorithm that maintains a 
bijection ord : V ^ {1,2, . . . ,«} which is our topological ordering. Let us assume that the graph is initially 
the empty graph and so any bijection from y to {1,2, ...,«} is a valid topological ordering of V at the onset. 
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New edges get inserted to this graph and after each edge insertion, we want to update the current bijection 
from y to {1,2, ...,«} to a valid topological ordering. 

Let the function ord from V onto {1,2, . . . ,«} denote our topological ordering. We also have the inverse 
function of ord stored as an array A[l..n], where A [j] is the vertex x such that ord(x) = i. We have the n x n 
0-1 adjacency matrix M of the directed acycUc graph, corresponding to the edges inserted so far; M[x,y] = 1 
if and only if there is an edge directed from vertex x to vertex y. 

When a new edge (m,v) is added to G, there are two cases: (i) either ord(M) < ord(v) in which case the 
current ordering ord is still a vaUd ordering, so we need to do nothing, except set the entry M[m,v] to 1, or 
(ii) ord{u) > ord(v) in which case we need to update ord. We now present our algorithm to update ord, when 
an edge (m,v) such that ord(M) > ord(v), is added to G. If (m,v) creates a cycle, the algorithm quits; else it 
updates ord to a valid topological ordering. 

[Let s -^t indicate that there is a directed path (perhaps, of length 0) from vertex s to vertex ? in G. If 
J we say j is an ancestor of t and ? is a descendant of s. We use s ^tto indicate that {s,t) G E.] 

2.1 Our algorithm for inserting a new edge (m, v) where ord (m) > ord (v) 

Let ord(v) = / and ord(M) = j. Our algorithm works only on the subarray A[/..j] and computes a subset (call 
it ANC) of ancestors of u in A[i..j] and a subset (call it DES) of descendants of v in A[/..j]; it assigns new 
positions in A to the vertices in ANC U DES. This yields the new topological ordering ord. 

We describe our algorithm in two phases: Phase 1 and Phase 2. In Phase 1 we construct the sets: 

DES = {y - i < ord(y) < t and v -^y} (the set of descendants of v in the subarray A [/..?]) 
ANC = {w :t < ord(H') < j and w u} (the set of ancestors of u in the subarray A[?..7]) 

where ? is a number such that i <t < j and t has the following property: ifG is a DAG, then the number of 
descendants ofv in A[i..t] is exactly equal to the number of ancestors ofu inA[(t + l)..j]. 

We then check if (x,y) e E for any x e DES and y G ANC, or if A [f] G DES n ANC. If either of these is 
true, then the edge (m,v) creates a cycle and G is no longer a DAG, so our algorithm quits. Else, we delete 
the elements of DES U ANC from their locations in A, thus creating empty locations in A. 

Phase 2 deals with inserting elements of ANC in A[/../] and the elements of DES in A[(? + l)..j]. Note 
that we caimot place the vertices in ANC straightaway in the empty locations previously occupied by DES 
in A[i..t] since there might be further ancestors of elements of ANC in A[/..?]. Similarly, there might be 
descendants of elements of DES in A[(? + Hence we need Phase 2 to add more elements to ANC and 

to DES, and to insert elements of ANC in A[/..f] and those of DES in A[(/ + l)..j] correctly. 

2.1.1 Phase 1. 

We now describe Phase 1 of our algorithm in detail. Initially the set ANC = {u} and the set DES = {v}. 
We maintain ANC and DES as queues. We move a pointer LeftPtr from location j leftwards (towards i 
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Figure 1: The pointer LeftPtr and RightPtr meet at the location t. 

as shown in Figure 1) in order to find an ancestor w of u, which gets added to the end of the queue ANC. 
Then we move a pointer RightPtr from location i rightwards to find a descendant y of v, which gets added 
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to the end of DES. Then we go back to LeftPtr, and thus interleave adding a vertex to ANC with adding a 
vertex to DES so that we balance the size of ANC constructed so far with the size of DES. When LeftPtr 
and RightPtr meet, that defines our desired location t. We present the detailed algorithm for Phase 1 as 
Algorithm 2.1. [If (x,y) G E, we say x is a predecessor of y and y is a successor of x.] 

Algorithm 2.1 Our algorithm to construct the sets ANC and DES in Phase 1. 

- Initialize ANC = {u} and DES = {v}. 

- set RightPtr = i and LeftPtr = j. {So LeftPtr > RightPtr.} 
while TRUE do 

LeftPtr = LeftPtr - 1; 

while LeftPtr > RightPtr and A[LeftPtr] is not a predecessor of any vertex in ANC do 

LeftPtr = LeftPtr- 1; 
end while 

if A[LeftPtr] is a predecessor of some vertex in ANC then 

- Insert the vertex A[LeftPtr] to the queue ANC. 
end if 

if LeftPtr = RightPtr then 

break {this makes the algorithm break the while TRUE loop} 
end if 

{Now the symmetric process from the side of v.} 
RightPtr = RightPtr+1; 

while RightPtr < LeftPtr and A[RightPtr] is not a successor of any vertex in DES do 

RightPtr = RightPtr +1; 
end while 

if A [RightPtr] is a successor of some vertex in DES then 

- Insert the vertex A [RightPtr] to the queue DES. 
end if 

if RightPtr = LeftPtr then 
brealt 

end if{If the algorithm does not break the while TRUE loop, then LeftPtr > RightPtr.} 
end while 



The above algorithm terminates when LeftPtr = RightPtr is satisfied. Set / to be this location: that is, 
t = RightPtr = LeftPtr. It is easy to check that Algorithm 2.1 constructs the sets: 

DES = {y'.i< ord(y) < / and v ^ y}; ANC = {w.t < ord(w) < j and w u}. 

That is, DES is the set of descendants of v in A[i..t] and ANC is the set of ancestors of m in A[?..y]. The 
following lemma is straightforward. 

Lemma 1 If the new edge (m,v) creates a cycle, then (i) either A[t] G DES PI ANC or (ii) there is some 
X G DES and y G ANC such that there is an edge from x to y. 

Proof: If a cycle is created by the insertion of the edge (m,v), then it implies that v m in the current 
graph. That is, there is a directed path p in the graph from v to u, before (m,v) was inserted. This implies 
that there is either an element inDESnANCor there is an edge in p that connects a descendant of v in A [/ . ./] 
to an ancestor of u vaA[t..]\ (see Figure 2). 

Hence either there is an element in DES n ANC, the only such element can be A[f], or there has to be an 
edge from a vertex in DES to a vertex in ANC, where DES is the set of descendants of v in A[j..?] and ANC 
is the set of ancestors of u in A[/..j]. I 
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The path p from v to m 



Figure 2: There has to be an edge in p from a descendant of v in A[i..t] to an ancestor of u in A[?..j] or 
A[?] e ANCnDES. 

Lemma 1 shows us that cycle detection is easy, so let us assume henceforth that the edge (m,v) does 
not create a cycle, hence the graph G is still a DAG. The following lemma shows that the number t has the 
desired property that we were looking for. 

Lemma 2 The number t has the property that the number of descendants of v in A[i..t] is equal to the 
number of ancestors ofu inA[{t + 

Proof: We have t = RightPtr = LeftPtr. When we terminate the while loop, ANC is the set of ancestors 
of M in A . . j] and D ES is the set of descendants of v from A . . j] . Since (w, v) does not create a cycle, A [t] is 
either a descendant of v or an ancestor of u, but not both. 

If the main while loop got broken because of the first break statement, then A[?] is a descendant of v and 
if the main while loop got broken due to the second break statement, then A [t] is an ancestor of u. Since we 
interleave adding a vertex to the set ANC with adding a vertex to the set DES, if we break because of the 
first break statement, we have |ANC| = |DES|; and if we break because of the second break statement, we 
have |ANC| = |DES| + 1. Thus in both cases, it holds that the number of descendants of v in A[j..?] is equal 
to the number of ancestors oiu\aA[{t + \)..j\. I 

At the end of Phase 1, since G is still a DAG, we delete all the elements of ANC U DES from their 
current locations in A. The vertices in ANC have to find new places in A[/..f] and the vertices in DES have 
to find new places in A[(? + This forms Phase 2 of our algorithm. 

Phase 2. We now describe Phase 2 from the side of ANC as Algorithm 2.2. (A symmetric procedure 
also takes place on the side of DES.) In this phase vertices get deleted from ANC and vertices can also get 
added to ANC. 

Algorithm 2.2 Phase 2 of our algorithm from the side of ANC. 

LeftPtr = f; 
while LeftPtr > / do 
if A[LeftPtr] is an empty location then 

- delete the head of the queue ANC, call it h, and set A[LeftPtr] = h 
else if A [LeftPtr] (call it r) is a predecessor of some element in ANC then 

- insert r to the queue ANC 

- delete the head of the queue ANC, call it h, and set A [LeftPtr] = h 
end if 

LeftPtr = LeftPtr - 1; 
end while 



In Phase 2, the pointer LeftPtr traverses the subarray A[t..i] (from t leftwards to and elements get 

deleted/inserted in ANC. Whenever LeftPtr sees an empty location in A, the head of the queue ANC is 
deleted from ANC and is assigned to that empty location. Whenever A [LeftPtr] is a predecessor of some 

■^Note that the number of empty locations in A[i..t\ exactly equals |ANC| and the number of empty locations in A[{t + l)..y] 
exactly equals | DES| . 
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element in ANC, then A[LeftPtr] is removed from that location and is inserted into ANC and the current 
head of the queue ANC is inserted into that location. We have the following lemma, which is simple to 
show. Its proof is included in the Appendix. 

Lemma 3 The subroutine in Algorithm 2.2 maintains the following invariant in every iteration of the while 
loop: the number of elements in ANC equals the number of empty locations in the subarray A[i..V.e^tPtr\. 

Proof: It is easy to see that the invariant is true at the beginning of the subroutine. In other words, at the 
end of Phase 1, the number of empty locations in A[/..f] exactly equals |ANC|. 

We will now show that this invariant is maintained throughout Phase 2. Whenever LeftPtr sees an 
empty location in A, we delete an element from ANC, hence this invariant is maintained since the number 
of empty locations in A [/..LeftPtr] decreases by one and so does the size of ANC. Whenever LeftPtr sees 
a predecessor p of some element of ANC at the current location, then p is deleted from its current location 
LeftPtr = ^ in A and p is inserted into ANC; the leading element of ANC gets deleted from ANC and we 
assign A[l] = h. Thus neither the number of empty locations in A[/.. LeftPtr] nor the size of ANC changes 
by our deletion and insertion, so the invariant is maintained. Hence when we exit the while loop, which is 
when LeftPtr = / — 1, the queue ANC will be empty. I 

We then traverse the subarray A[(? + from location ? + 1 to location j and execute the algorithm in 
Algorithm 2.2 with respect to DES. For the sake of completeness, we present that algorithm as Algorithm 2.3 

below. 

Algorithm 2.3 Phase 2 of our algorithm from the side of DES. 

RightPtr = ? + l; 
while RightPtr < ; do 
if A[RightPtr] is an empty location then 

- delete the head of the queue DES, call it I, and set A [RightPtr] = £ 
else if A[RightPtr] (call it r) is a successor of some element in DES then 

- insert r to the queue DES 

- delete the head of the queue DES, call it i, and set A [RightPtr] = t 
end if 

RightPtr = RightPtr+ I; 
end while 



A lemma analogous to Lemma 3 will show that there is always enough room in the array A [RightPtr.. y] 
to accommodate the elements of DES (refer Algorithm 2.3). This completes the description of our algorithm 
to update the topological ordering when a new edge is inserted. 

2.1.2 Correctness. 

We would now like to claim that after running Phase 1 and Phase 2 of our algorithm, we have a valid 
topological ordering. Our ordering is defined in terms of the array A. Each element x that has been assigned 
a new location in A has a new ord value, which is the index of its new location. For elements that never 
belonged to ANC U DES, the ord value is unchanged. For the sake of clarity, let us call the ordering before 
the new edge (m,v) was inserted as ord and let ord' denote the new function after executing our algorithm. 
We will show the following theorem here (its proof is included in the Appendix). 

Theorem 3 The function ord' is a valid topological ordering. 

Proof: We need to show that ord' is a valid topological ordering. Consider any edge {x,y) in the graph. We 
will show that ord'{x) < ord'(j). We will split this into three cases. 
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• ;c G ANC. There are three further cases: (i) y G ANC, (ii) j G DES, (iii) y is neither in ANC nor in 
DES. In case (i), both x and y are in ANC and since there is an edge from x to y, the vertex y is ahead 
of X in the queue ANC. Soy gets deleted from ANC before x and is hence assigned a higher indexed 
location in A than x. In other words, ord'(;t) < ord'(y). In case (ii), we have ord'(x) <t < ord'(y). 
In case (iii), since elements of ANC move to lower indexed locations in A, we have ord'(x) < ord(x) 
whereas ord'(y) = ord(y); since ord(x) < ord(y), we get ord'(x) < ord'(y). 

• X G DES. There are only two cases here: (i) y G DES or (ii) y is neither in ANC nor in DES. This 
is because if x G DES and y G ANC, then y m — > v ^ x. So (x,y) G E creates a cycle. This is 
impossible since we assumed that after inserting the edge (m,v), G remains a DAG. Thus we cannot 

have X G DES and y G ANC for (x,y) G E. 

In case (i) here, because (x,y) G E, the vertex x is ahead of the vertex y in the queue DES, so x gets 
deleted from DES before y and is hence assigned a lower indexed location in A than y. Equivalently, 
ord' (x) < ord'{y) . In case (ii) here, since x G D ES and y ^ D ES, it has to be that either ord (y) > ord (w) 
in which case ord'(x) < ord(>') = ord'(y), or by the time RightPtr visits the location in A containing y, 
the vertex x is already deleted from the queue DES - otherwise, y would have been inserted into DES. 
Thus ord'(x) < ord(_y) = ord'(_y). 

• X ^ ANC U DES. There are three cases again here: (i) y G ANC, (ii) y G DES, (iii) y is neither in ANC 
nor in DES. The arguments here are similar to the earher arguments and it is easy to check that in all 
three cases we have ord'(x) < ord'(j). 

Thus ord' is a valid topological ordering. I 

Thus our algorithm is correct. In Section 3 we will show that its running time, summed over all edge 
insertions, is 0{n^l^). 

3 Running Time Analysis 

The main tasks in our algorithm to update ord to ord' (refer to Algorithms 2.1, 2.2, and 2.3) are: 

(1) moving a pointer LeftPtr from location j to i in the array A and checking if A[LeftPtr] is a predecessor 
of any element of ANC; 

(2) moving a pointer RightPtr from location / to j in the array A and checking if A [RightPtr] is a successor 
of any element of DES; 

(3) checking at the end of Phase 1, if (x,y) G E for any x in {x : ? < ord(x) < t and v x} and any y in 
{y:t< ord(y) < j and y u}. (If so, then (m,v) creates a cycle.) 

Lemma 4 bounds the cost taken by Step 3 over all iterations. It can be proved using a potential function 
argument. 

Lemma 4 The cost for task 3, stated above, summed over all edge insertions, is 0{n^). 

Proof: We need to check if there is an edge (x,y) between some x G DES and some y G ANC. We pay a 

cost of |ANC| • |DES| for checking |ANC| • |DES| many entries of the matrix M. 

Case(i): There is no adjacent pair (x,^) for x G DESjj G ANC. Then the cost |DES| • |ANC| can be 
bounded by N{e), which is the number of pairs of vertices {y,x) for which the relationship j -w x has started 
now for the first time (due to the insertion of the edge (m,v)). Recall that at the end of Phase 1, DES is the 
set {x : / < ord(x) < t and v x} and ANC is the set {y : i < ord(>') < t and y u}. Thus each vertex in 
this set DES currently has a lower ord value than each vertex in ANC - so the only relationship that could 
have existed between such an x and yisx-^y, which we have ensured does not exist. Thus these pairs {x,y) 
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were incomparable and now the relationship y-^x has been established. It is easy to see that Eeg£ A^(e) is 
at most (2) since any pair of vertices can contribute at most 1 to LeG£^(^)- 

Case(ii): There is indeed an adjacent pair {x,y) for x G DES,_y G ANC. Then we quit, since G is no 
longer a DAG. The check that showed G to contain a cycle cost us |ANC| • | DES|, which is O(n^). We pay 
this cost only once as this is the last step of the algorithm. I 

Let ordf, be our valid topological ordering before inserting edge e and let ordg be our valid topological 
ordering after inserting e. Lemma 5 is our first step in bounding the cost taken for tasks 1 and 2 stated above. 

Lemma 5 The cost taken for tasks 1 and 2, stated above, is Y^xev |orde(x) — ordg(x)|. 

Proof: In Step 1 we find out if A[LeftPtr] = x is a predecessor of any element of ANC by checking the 
entries M[x,w] for each w G ANC. Each element w which is currently in ANC pays unit cost for checking 
the entry M[x,w]. 

Any element w G ANC belongs to the set ANC while LeftPtr moves from location orde(w) to ordg(w). 
When LeftPtr = orde(H') and we identify A[orde(H')] = w to be a predecessor of some element in ANC, the 
vertex w gets inserted into ANC. When LeftPtr is at some empty location P and the vertex w is the head 
of the queue ANC, then w is deleted from ANC and we set A[P] = w, which implies that ordg(w) = p. So 
the total cost paid by w is orde(>v) — ordg(w), which is to check the entries M[A [LeftPtr], w] as the pointer 
LeftPtr moves from location ordg(H') — 1 to ord',(i4''). 

Symmetrically, for any vertex y that belonged to DES during the course of the algorithm, the cost paid 
by y is ordg(_y) — orde(_y). A vertex z that never belonged to ANCU DES, does not pay anything and we 
have ordg(z) = orde(z). Thus for each x G V, the cost paid by x to move the pointers LeftPtr/RightPtr is 
|orde(x)-ordg(x)|. I 

We will show the following result in Section 3.1. 

Lemma 6 LeesLArGV '■5' 0{rv'l'^), where ordg is our valid topological ordering before 

inserting edge e and ord^ is our valid topological ordering after inserting e. 

Theorem 1, stated in Section 1, follows from Theorem 3, Lemmas 4 and 6. Also note that the space 
requirement of our algorithm is 0{n^), since our algorithm uses only the n x n adjacency matrix M, the 
array A, the queues ANC, DES, and the pointers LeftPtr, RightPtr. 

3.1 Proof of Lemma 6 

Let e = (m, v) and let ordg be our topological ordering before inserting (w, v) and ord^ our topological ordering 
after inserting (m,v). Our algorithm for updating ordg to ord^ basically permutes the vertices in the subarray 
A [i..f\ . The elements which get inserted into the queue ANC move to lower indexed locations in A (compared 
to their locations in A before e was added), elements which get inserted into the queue DES move to higher 
indexed locations in A, and elements which did not get inserted into either ANC or DES remain unmoved in 
A. So our algorithm is essentially a permutation 7lg of elements that belonged to ANCU DES. 

Let ANCg denote the ordered set of all vertices that get inserted to the set ANC in Algorithms 2.1 and 

2.2 (and of course, later get deleted from ANC in Algorithm 2.2) while inserting the edge e. In other 
words, these are the vertices w for which orde(w) > ordg(M'). Define DESg as the ordered set of all those 
vertices w for which ordi;{w) < ordg(w). Equivalently, these are all the vertices that get inserted into DES in 
Algorithms 2.1 and 2.3. Let ANCg = {mo,mi, . . .,Uk}, where uq = u and ord(Mo) > ord(Mi) > • • • > ord(M;fe), 
and let DESg = {vq, vi, . . . , v^}, where vq = v, and ord(vo) < ord(vi) < • • • < ord(Vi). 

Let us assume that all the vertices of ANCg U DES^ are in their old locations in A (those locations given 
by the ordering ordg; so A[/] = v and A[/] = u). We will now decompose the permutation Tig on these 
elements into a composition of swaps. Note that our algorithm does not perform any swaps, however to 
prove Lemma 6, it is useful to view lig as a composition of appropriate swaps. The function swap(x, j) takes 
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as input: x G ANCg and y G DES^, where ord(x) > ord(y), and swaps the occurrences of x and y in the array 
A. That is, if ^[/z] = x and = y, where h> I, then swap(x, j) makes A [^] = x and A[/i] = y. 

The intuition behind decomposing %e into swaps between such an element x G ANCg and such an element 
y G DESg is that we will have the following useful property: whenever we swap two elements x and y, it is 
always the case that ord(x) > ord(j) and we will never swap x and y again in the future (while inserting 
other new edges) since we now have the relationship x-^y, so ord(>') > ord(x) has to hold from now on. 

We will use the symbol ord to indicate the dynamic inverse function of A, so that ord reflects instantly 
changes made in the array A. So as soon as we swap x and y so that A[£] = x and A[/z] = y, we will say 
ord(x) = i and ord(y) = h. Thus the function ord gets initiahzed to the function ordg, it gets updated with 
every swap that we perform and finally becomes the function ord^. 

3.1.1 3.1.1 Decomposing 71^ into appropriate swaps. 

- Initialize the permutation Tig to identity and the function ord to ordg. 

for X G {uk,uic-i , . . . , Mo} (this is ANC^: elements in reverse order of insertion into ANCg) do 
for y e {vs,Vs-\,..., vo} (this is DESg: elements in reverse order of insertion into DESg) do 
if ord (x) > ord(y) then 

Tie = swap(x, o Jig (*) {Note that swapping x and y changes their ord values.} 
end if 
end for 
end for 

- Return 71^ (as a composition of appropriate swaps). 

To get a better insight into this decomposition of He, let us take the example of the element u^ G ANC 
(uk has the minimum ordg value among all the elements in ANC). Let vq, vi , . . . , be the elements of DESg 
whose orde value is less than ordg(M^:) = a. Recall that orde(vo) < • • • < orde(vr) < orde(vr+i) < • • •orde(v.v). 
When the outer for loop for x = ut is executed, u^ does not swap with v.,, . . . , v,.+i. The first element that u^ 
swaps with is Vr, then it swaps with Vr-i, so on, and Uk finally swaps with vq and takes the location / in A that 
was occupied by vq. Thus ord^(Mfe) = i and ord(vo), . . . ,ord(vr) are higher than what they were formerly, 
since each G {vq, . . . , v^-i} is currently occupying the location that was formerly occupied by v^+i, and Vr 
is occupying u^'s old location a. Thus the total movement of from location a to location i, can be written 
as: 

orde{uk)-ord'^{uk) = a-i= d{uk,V() 

V<€{Vr,...,Vo} 

where d{uk,V() = ord{uk) — ord(v^) when swap(M;t, v^) is included in 71^ (refer to (*) in Section 3.1.1). 

Correctness of our decomposition of 7i^. It is easy to see that the composition of swaps. Tig, that is 
returned by the above method in Section 3.1.1, when applied on ANQ = {uk,Uk-i,...,uo} and DES^ = 
{vo, . . . ,Vs-i,Vs}, results in these elements occurring in the relative order: Uk,uic-i,. . . , mo,vo, . . . ,Vs-\,Vs in 
A. We claim that our algorithm (Algorithms 2.1, 2.2, 2.3) places these elements in the same order in A. This 
is because our algorithm maintained both ANC and DES as queues - so elements of ANCg (similarly, DESg) 
do not cross each other, so m^, . . . ,mo (resp., vq, . . . , v^) will be placed in this order, from left to right, in A. 
Also, we insert all elements of ANCg in A[j..?] and all elements of DESg in A[(? + l)..y], thus our algorithm 
puts elements of ANCgU DESg in the order Uk,Uk-\,. . . , M(),vo, . . . ,Vs_i,Vs in A. Thus we have obtained a 
correct decomposition (into swaps) of the permutation performed by our algorithm. 

For every pair (x,j) G ANCg x DESg, if swap(x,j) is included in Tig (see (*)), define d{x,y) = ord(x) — 
ord(y), the difference in the location indices occupied by x and y, when swap(x,y) gets included in Tig. For 
instance, d{uk,vo) = orde(vi) — orde(vo) since Uk is in the location orde(vi) (due to swaps with Vr, . . . ,vi) 
and Vo is unmoved in its original location orde(vo), at the instant when swap(M;t) vo) gets included in Ttg. 
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Since we broke the total movement in A of any x G ANCg (which is orde(x) — ord'g{x)) into a sequence 
of swaps with certain elements in DESg, we have for any x G ANCg 

ordg(x)-ord^(x) = ^(^'3')' 

y:(x,y)ene 

where we are using "{x,y) G Ttg" to stand for "swap(x,y) exists in Ttg". 

Wehave ^|ord,W-ord;(x)| = ^ (ord,(w) - ord;(w)) + ^ (ord^lj) - ord,(y)) (1) 
xev weANQ yeDESe 

= 2 £ (ord,(x)-ord^(x)) (2) 

xgANC,. 

= 2 £ £ (3) 

^€ANCe y:{x,y)ene 

= 2 £ d{x,y). (4) 

(x,y):(x,y)e7Ce 

Equality (2) follows from (1) because Y,xev °'''^{^) = Hxev^^'^'i^)- Equality (3) follows from the preceding 
paragraph. So the entire running time to insert all edges in E is 2Y,g^EY,(x,yy.(x,y)e%e ^i^iy)- 

We now claim that for any pair {x,y), we can have {x,y) G Tig for at most one permutation Ttg. For 
swap(x,y) to exist in Tig, we need (i) {x,y) G ANCg x DESg, and (ii) ord(x) > ord(y). Once Tie swaps x 
and y, subsequently x y (since x --^ u ^ v y) and ord(y) > ord(x), so {x,y) can never again satisfy 
ord(x) > ord(y). So for any pair {x,y), swap(x,>') can occur in at most one permutation Jig over all e G -E. 
Thus we have: 

£ £ d{x,y)= £ d{x,y). (5) 

e£E (^x,y):{x,y)£TCg {x,y):{x,y)£Kg for some e 

Note that the summation on the right hand side in Inequality (5) is over all those pairs (x,y) EV xV such 
that swap(x, j) exists in some Jig, for e G 

The following lemma was shown in [1]^. This finishes the proof of Lemma 6. 

Lemma 7 £(i(x,>') is Oijr'l^), where the summation is over all those pairs (x,y) such that swap(x,y) exists 
in some permutation Jle,e G 

Proof: We present the proof of this lemma given in [1]. We need to show that Y,x,yd(x,y) is 0(n^/^). Let 
ord* denote the final topological ordering. Define 



X(ord*(x),ord*(y)) 



d(x,y) if there is some permutation Ttg that swaps x and y 
otherwise. 



Since swap(x,y) can occur in at most one permutation 71^, the variable X{i,j) is clearly defined. Next, we 
model a few linear constraints on X{i,j), formulate it as a linear program and use this LP to prove that 
max{£,jX(/, j)} = 0(n^/^). By definition of d{x,y) andX{i,j), 

< X{i,j) < n, for all ij G {1 . . .«}. 

For j < i, the corresponding edges (ord*^'(/),ord*^^(j)) go backwards and thus are never inserted at all. 
Consequently, 

= Oforall j<i. 

Now consider an arbitrary vertex w, which is finally at position /, i.e., ord*(w) = /. Over the insertion of 
all the edges, this vertex has been moved left and right via swapping with several other vertices. Strictly 



^The algorithm in [1] performs swaps to obtain a valid topological ordering and Lemma 7 is used in their analysis to show an 
0{rp-''^) upper bound for the running time of their algorithm. 
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speaking, it has been swapped left with vertices at final positions j > i and has been swapped right with 
vertices at final position j < i. Hence, the overall movement to the left is Y,j>i^{hj) to the right is 
ILjKi^Uj 0- Since the net movement (difference between the final and the initial position) must be less than 
n, 

Y^^iiJ) - Y^^U, i) < n for all l<i<n. 
Putting all the constraints together, we aim to solve the following hnear program. 

max ^ such that 

l<i<n, \<j<n 

(i) X{i,j) = for all 1 < j < n and 1 < j < i 

(ii) < j) < n for all 1 < ? < n and i < j <n 

(iii) Lj>i^ {h j) - Lj<iXU, i)<n-l, for all 1 < ? < n 

In order to prove the upper bound on the solution to this LP, we consider the dual problem: 



mm 



0<i<n, i< j<n 0<i<n 



such that 



(i) Yj.n^j > 1 for all < j < n and for all j < i 

(ii) Yi-n+ j + Yn^+i — Y„2_^ ; ^ 1 for all < ? < « and for all j > i 

(iii) Yi>Q for all < / < + „ 

and the following feasible solution for the dual: 

Yi.„j^j = 1 for all < j < n and for all 0< j <i 

Yi.n+j = 1 for all < j < n and for all i< j <i + \ +2^/n 

Yi.n+j = for all < j < n and for all j>i+\ + 2 Vn 



Yrp-+i 



= Vn — i for all < ? < n. 



The solution has a value of rp' + 2n^/-^ + nE/Li = 0(n^/'^), which by the primal-dual theorem is a bound 
on the solution of the original LP. This completes the proof of Lemma 7 and thus Lemma 6 is proved. I 



4 The 0{{m + n log n) ^/m) algorithm 

In this section we present an algorithm with running time 0{{m + nlogn)Y^) for online topological order- 
ing. This algorithm is an adaptation of the algorithm by Katriel and Bodlaender in [6] and uses the Ordered 
List data structure from [5], also used in [6] for this problem. That is, the function ord on V is maintained by 
a data structure ORD which is a data structure that allows a total order to be maintained over a list of items. 
Each item x in ORD has an associated integer label ord {x) and the label associated with x is smaller than 
the label associated with y, iff x precedes y in the total order. The following operations can be performed in 
constant amortized time [see Dietz and Sleator [5], Bender et al. [3] for details]: the query Order{x,y) de- 
termines whether X precedes y or y precedes x in the total order (i.e., if ord(x) < ord(y) or ord(y) < ord(x)), 
InsertAfter{x, y) (InsertBefore {x,y)) inserts the item x immediately after (before) the item y in the total order, 
and Delete (x) removes the item x. 

When a new edge (m,v) is added to a graph G, there are two cases: (i) either Order{u,v) is true, in which 
case the current ordering of elements in ORD is still a valid ordering, so we need to do nothing except add 



11 



(m,v) in the list of edges incoming into u and in the list of edges going out of v; (ii) Order{u,v) is false, in 
which case the edge (m,v) is invalidating and we need to change the order of vertices in ORD. 

Our algorithm to insert an invalidating edge (m,v) performs various steps. Each step involves visiting an 
ancestor of u and/or visiting a descendant of v. 

• Initially u is the only ancestor of u that we know. So we visit u. We use a Fibonacci heap to store 
ancestors of u that we have seen but not yet visited. For an ancestor x of u, visit(x) means that for every edge 
{w,x) into X, we check if w is already present in F„ and if w is not present in F^, we insert w into F^. 

• The next ancestor of u that we visit is the vertex with the maximum ORD label in Fj,. An extract-max 
operation on this F-heap (the priority of vertices in Fj, is determined by how high their associated label is in 
ORD) determines this vertex x. 

• Analogously, we have a Fibonacci heap Fy to store descendants of v that we have seen but not yet 
visited. For any descendant y of v, visit(y) means that for every edge {y,z) out of y, we check if z is already 
present in the F-heap Fy and if z is not present in Fy, we insert z into Fy. The priority of vertices in Fy 
is determined by how low their associated label is in ORD. Thus an extract-min operation on this F-heap 
determines the next descendant of v that we visit. 

• At the end of each step we check if Order{x,y) is true, where x is the last extracted vertex from F„ 
and y is the last extracted vertex from F^. If Order{x,y) is true (i.e, if x precedes y in ORD), then this is 
the termination step; all the ancestors of u that we visited, call them {mo(= m), . . . ,Uk} and the descendants 
of V that we visited, call them {vo(= v),... ,Vs}, get reinserted in ORD after x or before y, in the order 
uic,...,uo,vo,...,Vs. Else, i.e, if y precedes x in ORD, then we delete x and y from their current positions in 
ORD and in the next step we either visit x or y or both x and y. 

In any step of the algorithm, if {mq, wi , . . . , Mr} is the set of ancestors of u that we have already visited 
(in this order, so ord(Mr) < ••• < ord(Mo)) in the previous steps, then the ancestor of u that we plan to 
visit next is the vertex x with the maximum ORD label that has an edge into a vertex in {mo,mi, . . . ,Ur}. 
Once we visit x, we would have visited all ancestors of u with ORD labels sandwiched between ord (x) and 
ord(M). Similarly, on the side of v, if vo,vi, . . . are the descendants of v that we have already visited (i.e., 
ord (vo) < • • • < ord (v^ )), then the descendant of v that we plan to visit next is the vertex y with the minimum 
ORD label which has an edge coming from one of {vq, vi , . . . , Vf }. 

When Order {x,y) is true, it means that we have discovered all descendants of v with ORD label values 
between ord (v) and i, and all ancestors of u with ORD label values between i and ord (w) (where i is any value 
such that ord{x) < i < ord(y)). Thus we can relocate vertices i/yt, . . . ,mo,vo, . . . ,Vv (in this order) between x 
and >'. It is easy to see that now for every {a,b) G E, we have that a precedes h in ORD. 

What remains to be explained is how to make the choice between the following 3 options in each step: 
(i) visit(x) and visit(y), (ii) only visit(x), or (iii) only visit(y). 

Visit(x) and/or Visit(y). In order to make the choice between visit(x) and/or visit(y), let us make the 
following definitions: Let ANC denote the set of ancestors of u that we have already visited plus the ancestor 
X that we plan to visit next. Let DES denote the set of descendants of v that we have already visited plus the 
descendant y that we plan to visit next. Let mo be the sum of out-degrees of vertices in DES and let be 
the sum of in-degrees of vertices in ANC. 

If we were to visit x in the current step, then the total work done by us on the side of u so far would be 
mA + I ANC| log« (to have examined ma edges incoming into ANC and for at most mA insertions in F„, and 
to have performed | ANC| many extract-max operations on F^). Similarly, if we were to visit y in the current 
step, then the total work done by us on the side of v so far would be mo + 1 DES| logn. 

Definition! 7jfm>i < mo < mA + |ANC| logn or mD<mA<mD + \DES\lognthenwesaythatmAandmD 
are "balanced" with respect to each other. Else we say that they are not balanced with respect to each other 

If Ma and md are balanced with respect to each other, then we visit both x and y. Else if ma < mo, then 
we visit only x, otherwise we visit only y. This is the difference between our algorithm and the algorithm 
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in [6] - in the latter algorithm, either only x is visited or only y is visited unless mo = ntA, in which case 
both X and y are visited. In our algorithm we are ready to visit both x and y more often, that is, whenever 
Ma and mo are "more or less" equal to each other. If we visit both x and y, then the total work done is 
mA + 1 A N C I log n+mD + |DES| log n. We can give a good upper bound for this quantity using the fact that niA 
and rtiD are balanced with respect to each other. On the other hand, if ma and mj) are not balanced w.r.t. each 
other, we are not able to give a good upper bound for the sum of (m^ + | ANC| logn) and {mo + | DES| logn), 
hence we visit either x or y, depending upon the smaller value in {mA,mD}- 

4.1 The algorithm 

Our entire algorithm to reorder vertices in ORD upon the insertion of an invalidating edge (m, v) is described 
as Algorithm 4.1. This algorithm is basically an implementation of what was described in the previous 
section with a check at the beginning of every step to see if and nij} are balanced with respect to each 
other or not. If they are, then we visit both x and y. Else, we visit only one of them {x if niA < mo, else y). 
The algorithm maintains the invariant that the ORD labels of all elements in ANC are higher than the ORD 
labels of all elements in DES. The termination condition is determined by Order {x^y) being true, where x 
is the last extracted vertex from F„ and y is the last extracted vertex from Fy. 

For simphcity, in the description of the algorithm we assumed that the heaps F„ and Fy remain non- 
empty (otherwise extract-max/extract-min operations would return null values) - handling these cases is 
easy. We also assumed that the edges inserted are the edges of a DAG. Hence we did not perform any cycle 
detection here. (Cycle detection can be easily incorporated, by using 2 flags for each vertex that indicate its 
membership in F„ and in Fy.) When our algorithm terminates, it is easy to see the order of vertices in ORD 
is a valid topological ordering. We discuss the running time of Algorithm 4. 1 in the next section. 

4.1.1 The running time 

Let T{e) denote the time taken by Algorithm 4.1 to insert an edge e. We need to show an upper bound 

for where the sum is over all invalidating edges e. For simplicity of exposition, let us define the 

following modes. While inserting an edge (m,v), if a step of our algorithm involved visiting an ancestor of u 
and a descendant of v, we say that step was performed in mode (i). That is, at the beginning of that step, we 
had niA and mo balanced with respect to each other. If a step involved visiting only an ancestor of u, then 
we say that the step was performed in mode (ii), else we say that the step was performed in mode (Hi). 

We partition the sum Y.e'^i^) i^ito 2 parts depending upon the mode of the termination step of our 
algorithm. Let = Y,eT{e) be the time taken by our algorithm over all those edges e such that the ter- 
mination step was performed in mode (i). Let 52 = Ee^(^) where the sum is over all those edges e such 
that the termination step was performed in mode (ii) or mode (iii). We will show that both 5i and 52 are 
0{{m + n\ogn)^/m). These bounds on S\ and 52 will prove Theorem 2 stated in Section 1. 

The following lemma shows the bound on 5i. We then show an analogous bound on 52. 

Lemma 8 5i is 0{{m + nlogn)^/m). 

Proof: Let us consider any particular edge e; = (m, v) such that the last step of Algorithm 4. 1 while inserting 
ei was performed in mode (i). So the termination step involved visiting an ancestor Uk of u, extracting the 
next ancestor x of u, visiting a descendant of v, extracting the next descendant y of v and then checking 
that X precedes y in ORD. 

Let the set ANC = {m, Ml , ... , Uk} and the set DES = {v, vi , . . . , }. Let be the sum of in-degrees 
of vertices in ANC and let m^ be the sum of out-degrees of vertices in DES. During all the steps of the 
algorithm, we extracted | ANC| many vertices (the vertices u\ Uk and x) from Fu and | DES| many vertices 
fromFy. So we have r(e,) is 0{mA + |ANC|Iogn-|-m£)-|- |DES|Iogn). 

Since the termination step was performed in mode (i), we have that niA < mj) <mA + |ANC|logn or 
mo <mA< niD + | DES| logn. Without loss of generality let us assume that ruA < mo < + | ANC| logn. 
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Algorithm 4.1 Our algorithm to reorder vertices in ORD upon insertion of an invalidating edge (m,v). 



Initially, ANC = {u] and DES = {v}. 

Set x = u and y = v. Delete x and y from their current locations in ORD. 
Set MA = m's in-degree and mo = v's out-degree, 
while TRUE do 

if niA and mo are balanced w.r.t. each other (see Defn. 1) then 

Visit(x) and Visit(y). 

{So new vertices get inserted into F„ and into Fy.} 
X =extract-max(Fu) 
y =extract-min{Fy) 
if ord(x) < ord(y) then 

- insert all elements of ANC (with the same relative order within themselves) followed by all 
elements of DES (with the same relative order) after x in ORD 

break {This terminates the while loop} 
else 

Delete x and y from their current positions in ORD. 

ANC = ANCu{;c} and DES = DES U {y} 

niA = niA +x's in-degree and mo = mo + y's out-degree 
end if 
else ifmA <rnD then 
Visit(x) 

X =extract-max{Fu) 
if ord(x) < ord(j) then 

- insert all elements of ANC followed by all elements of DES after x in ORD. 
break 

else 

Delete x from its current position in ORD. 
Set ANC = ANCU {;<:} and ma = mA +Jc's in-degree. 
end if 
else 

Yisiiiy) 

Let y =extract-max{Fy) 
if ord(x) < ord(j) then 

- insert all elements of ANC followed by all elements of DES before y in ORD. 
break 

else 

Delete y from its current position in ORD. 
Set DES = DES U {y} and mo = mo + y's out-degree, 
end if 
end if 
end while 



Hence r(e, ) can be upper bounded by some constant times 

mA-F I ANC|logn-F I DES|logn. (6) 

Let us assume that jANC] > |DES|. (Note that the case |ANC| < |DES| is symmetric to this and the 
case |ANC| = |DES| is the easiest.) Since the termination step was performed in mode (i), for |ANC| to be 
larger than |DES|, it must be the case that at some point in the past, our algorithm to insert e, was operating 
in mode (ii) and that contributed to accumulating quite a few ancestors of u. Let step t be the last step 
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that was operated in mode (ii). So at the beginning of step t we had + | ANC'| Iog?i < m^, where ANC' 
was the set of ancestors of u extracted from the F-heap F„ till the beginning of step t and m'^ is the sum 
of in-degrees of vertices in ANC', and is the sum of out-degrees of vertices in DES' where DES' was 
the set of descendants of v extracted from the F-heap till the beginning of step t. After step t, we never 
operated our algorithm in mode (ii). Thus subsequent to step t whenever we extracted a vertex from F^, we 
also extracted a corresponding vertex from Fy. So we have |ANC| < |ANC'| + |DES|. Using this inequality 
in (6), we get that 

T{ei) < cipiA + |DES|logn-|- |ANC'|logn), for some constant c. 
Claim 1 We have the following relations: 

where is the number of pairs of edges {e,e')for which the relationship e e' has started now 
for the first time due to the insertion ofei. [We say {a,b) {c,d) ifb is an ancestor ofc] 

• |DES|2 < |ANC|-|DES| <N{ei) 

where N{ei) is the number of pairs of vertices (w,w') such that w -^w' has started now for the first 
time due to the insertion ofei. [We say w -^W ifw is an ancestor ofw'.] 

• (|ANC'|logn)2 < (|ANC'|log«)-mi) <»P(e,)log« 

where ^{ci) is the number of pairs (w,e) x E for which the relationship w e has started now 
for the first time. [We say w {a, b) ifw is an ancestor of a.] 

Proof of Claim 1. After the insertion of edge (m,v) we have e e' for every edge e incoming into ANC = 
{m, Ml , . . . , Uk} and every edge e' outgoing from DES = {v, vi , . . . , v^}. Prior to inserting e,-, the sink of each 
of the mA edges incoming into ANC has a higher ORD label compared to the source of each of the mo edges 
outgoing from DES - thus we could have had no relation of the form e e' between the edges incoming 
into ANC and the mo edges outgoing from DES. So <I>(e,) > mAmo- 

The above argument also shows that N{ei) > |ANC| • |DES|. We have |ANC'|m^ < ^(e,) because the 
source of each of these edges had a lower ORD label than the vertices in ANC' prior to inserting e, ; thus 
the relation w e' for each w G ANC' and the edges e' (m'j) many of them) outgoing from DES' is being 
formed for the first time now. I 

Now we are ready to finish the proof of Lemma 8. Corresponding to the insertion of each edge ej whose 
termination step was in mode (i), the work done by our algorithm is at most c{fj+gj + hf) where /? < ^{ej) 
and g^j < N{ej)\o^n and ft^ < ^{ej)\ogn. In order to bound T,j{fj + 8j + hj), we use Cauchy's inequality 

which states that Y!iLi Xi < \JYa^V^^ for xi , . . . ,Xm G M. This yields 

l.fj+Sj + h, < (V^+v^+v^)v^ (7) 

j 

< {m + n\ogn + \/nm\ogn)^/m. (9) 

We have Y,j^{ej) is at most (2) since each pair of edges e and e' can contribute at most 1 to 
similarly 'T.j^i.^j) is at most mn, and YljN{ej) is at most (2). This yields Inequahty (9) from (8). Since 
^/mn\ogn < (m + nlogn)/2, this completes the proof that the sum 5i is 0((m-|-nlogn)-y/m). | 

Analogous to Lemma 8, we need to show the following lemma in order to bound the running time of 
Algorithm 4.1 by 0{(m + nlogn)^/m). 



15 



Lemma 9 52 is 0{{m + n\ogn)^/m). 

Proof: Recall that ^2 = T.T{e) where the sum is over all those e such that the termination step of Algo- 
rithm 4.1 was performed in mode (ii) or mode (iii). Let us further partition this sum into £gr(e) over all 
those e for which the last step was performed in mode (ii) and Y,e' ^(^') over all those e' for which the last 
step was performed in mode (iii). The analysis for the second sum will be entirely symmetric to the first. 
We will now bound the first sum. 

Let = (m,v) be an edge such that the termination step of our algorithm was performed in mode (ii). 
Let the set ANC = {u,ui,.. .,Uk} and let the set DES = {v,vi, . . . ,Vs}. Let ma be the sum of in-degrees of 
vertices in ANC and let mo be the sum of out-degrees of vertices in DES. Since the termination step was 
performed in mode (ii), we have + |ANC| log« < m^). 

The work that we did in all the steps while inserting (m,v) from the side of the vertex m is + 
|ANC|logn. Let step / be the last step of our algorithm which was operated in mode (i) or in mode (iii). 
If there was no such step, then the total work done is at most + |ANC| log« and it is easy to bound this 
using the inequality ma + |ANC| log?i < nio- Hence, let us assume that such a step t did exist and let DES' 
be the set of descendants of v at the beginning of step t and let m'^ be the sum of out-degrees of vertices in 
DES'. Note that we have < -I- | ANC'| logn since this step was operated in mode (i) or in mode (iii). 

The total work done from the side of v is + |DES'|logn. Thus the total work r(e,) is 0{mA + 
|ANC|logn-|-m(,-|- |DES'|logn). Using the inequality < m^-|- |ANC'|logn < mA + |ANC| logn, we have 
T{ei) upper bounded by a constant times 

mA + |ANC| logn I DES' I logn. (10) 

Let us concentrate on the last term | DES'| logn in the above sum. Let DES" be the set of descendants of 
V at the beginning of the last step when we ran in mode (iii). So m'^ + |DES"| logn < m'^, where is the 
sum of the out-degrees of vertices in DES", ANC" is the set of ancestors of u at the beginning of this step 
and is the sum of the in-degrees of vertices in ANC". After this step, whenever we explored edges on the 
side of V, it was in mode (i), thus visiting a descendant of v was always accompanied by visiting an ancestor 
of u. So I DES' I < I DES" I + |ANC|. Substituting this bound in (10) we get that 

T{ei) < c{mA + |ANC|logn-|- |DES"|logn) for some constant c. 

We have the following relations (see Claim 1 for the definitions of and ^{ei)): 

• (mA + |ANC|logn)2 < (wta + |ANC| log?i)mD 

<^{ei)+^>{ei)\ogn. 

• (|DES"|logn)2 < (|DES"|logn)m;[ 

< v(£', )log« 

where v(e,) is the number of pairs {e,z) & E xV that get ordered with respect to each other for the 
first time now due to the insertion of e, . 

The proofs of the above relations are analogous to the proofs given in Claim 1 and we refer the reader 
to the proof of Claim 1 (in Section 4.1.1). 

We are now ready to complete the proof of Lemma 9. ^^(e, ) where the sum is over all those whose 
last step was performed in mode (ii) is at most £, {pi + qi ) where < <I> (e, ) + T (e, ) log n and ^? < V (e, ) log n. 
Note that Y.i'^i^i) is at most mn since each pair {e,z) G E xV can contribute at most 1 to ^("^(^0- Using 
Cauchy's inequality, we have 



i 

< ( y/£a>(ei)+£*I'(e/) logn +Y^£v(e,) logn 1 Vm 



< (y^rrP- + mnlogn + ^Jmn\ogn)^/m 

< {m + l^Jmn\ogn)\/m 



16 



Analogously, we can show that where the sum is over all those e, whose last step was performed 

in mode (iii) is at most 0{{m+ ^Jmn\ogn)^/m). Thus S2 is 0{{m + ^/mnlogn)^/m). Since y^mnlogn < 
{m + n log n)/2 (geometric mean is at most the arithmetic mean), we have ^2 is O ( (m + n log n) ^Jm) . I 



Conclusions. We considered the problem of maintaining the topological order of a directed acyclic graph 
on n vertices under an online edge insertion sequence of m edges. This problem has been well-studied and 
the previous best upper bound for this problem was 0(min{m^/^log«, m^/^ + n^logn, n^'^^}). Here we 
showed an improved upper bound of 0(min(n^/-^, {m-Vn\ogri)\fm)) for this problem. 

Acknowledgments. We are grateful to Deepak Ajwani and Tobias Friedrich for their helpful feedback. 

References 

[1] D. Ajwani, T. Friedrich, and U. Meyer. An algorithm for online topological ordering. In 

Proceedings of the 10th Scandinavian Workshop on Algorithm Theory, LNCS 4059: 53-64, 2006. 
(Longer version in http://arxiv.org/abs/cs/0602073). 

[2] B. Alpern, R. Hoover, B. K. Rosen, R F. Sweeney, and F. K. Zadeck. Incremental evaluation of com- 
putational circuits. In Proceedings of the first annual ACM-SIAM Symposium on Discrete Algorithms: 
32-42, 1990. 

[3] Two simplified algorithms for maintaining order in a Ust. In Proceedings of the 10th Annual European 
Symposium on Algorithms: 152-164, 2002. 

[4] T. Cormen, C. Leiserson, and R. Rivest. Introduction to Algorithms. The MIT Press, Cambridge, MA, 
1989. 

[5] P. Dietz and D. Sleator Two algorithms for maintaining order in a list. In Proceedings of the 19th 
Annual ACM Conference on Theory of Computing: 365-372, 1987. 

[6] I. Katriel and H. L. Bodlaender. Online topological ordering. In Proceedings of the 16th ACM-SIAM 
Symposium on Discrete Algorithms: 443-450, 2005. 

[7] A. Marchetti-Spaccamela, U. Naiini, and H. Rohnert. Maintaining a topological order under edge 
insertions. Information Processing Letter>s,59(l):53:58,1996. 

[8] A. Marchetti-Spaccamela, U. Nanni, and H. Rohnert. On-line graph algorithms for incremental com- 
pilation. In Proceedings of the International Workshop on Graph-Theoretic Concepts in Computer 
Science, LNCS 790: 70-86,1993. 

[9] Stephen M. Omohundro, Chu-Cheow Lim, and Jeff Bilmes. The sather language complier/debugger 
implementation. Technical Report TR-92-017, International Computer Science Institute, Berkeley, 
1992. 

[10] D.J.Pearce and P.H.J. Kelly. A dynamic algorithm for topologically sorting directed acyclic graphs. In 
Proceedings of the Workshop on Efficient and Experimental Algorithms, LNCS 3059: 383-398, 2004. 

[11] D.J.Pearce, P.H.J. Kelly, and C. Hankin. Online cycle detection and difference propagation for pointer 
analysis. In Proceedings of the third international IEEE Workshop on Source Code Analysis and Ma- 
nipulation, 2003. 

[12] G. RamaUngam and T. W. Reps. On competitive on-line algorithms for the dynamic priority-ordering 
problem. Information Processing Letters, 51:155-161, 1994. 



17 



